IN THE CLAIMS 



1 . (Currently Amended) A method, comprising: 

in response to a text sentence comprising a plurality of words, each having a part of 

speech (POS) tag, generating a POS sequence based on the POS tag of each of 

the plurality of words; 
detecting a prosodic phrase break from an output phrase break t hrough a recurrent 

neural network (RNN), based on the POS sequence, through iterative 

operations including 

retrieving a particular POS tag from the tag sequence, 

inputting a first phrase break to a first input of the RNN network, 

inputting a first POS tag from the tag sequence to a second input of the RNN 

network, wherein the first POS tag is associated with the first phrase 

break, 

inputting a second phrase break to a third input of the RNN network, 
inputting a second POS tag from the tag sequence to a fourth input of the RNN 

network, wherein the second POS tag is associated with the second 

phrase break, and wherein the second POS tag is next to the first POS 

tag in the tag sequence, and 
inputting the particular POS tag to a fifth input of the RNN network, 
inputting tho POS tag to th e RNN n e twork, including 

initializing and inputting a first initial phras e br e ak to a first input of th e 
RNN n e twork, 

initializing and inputting a first initial POS tag to a second input of th e 
RNN network, 

initializing and inputting a s e cond initial phras e break to a third input of 
tho RNN network, 
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inputting th e first POS tag of th e tag sequ e nce to a fourth input of th e 

RNN n e twork, and 
inputting th e s e cond POS tag of the tag s e quence to a fifth input of th e 

RNN network, 

generating aa-the output phrase break associated with the particular POS tag, 

from the RNN network, 
retrieving a next POS tag from the tag sequence, and 

repeating the above iterative operations until there are no more POS tags to be 
processed in the tag sequence; and 
generating a prosodic phrases boundary based on the prosodic phrase break. 

2. (Original) The method of claim 1 , further comprising: 

assigning a POS tag for each of the plurality of words of the sentence; and 
classifying the POS tag for each of the plurality of words to a predetermined class. 

3. (Original) The method of claim 2, wherein the classification of the POS tag comprises 

adjective, adverb, noun, verb, and number. 

4. (Original) The method of claim 3, wherein the classification of the POS tag further 

comprises quantifier, preposition, conjunction, idiom, and punctuation. 

5. (Original) The method of claim 1, further comprising segmenting the sentence into the 

plurality of words. 

6. (Canceled) 

7. (Canceled) 
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8. (Currently Amended) The method of claim 1 , further comprising: 

inputting the second-initial phrase break to the first input of the RNN network; 
inputting the fest -second P OS tag of th e tag s e qu e nc e t o the second input of the RNN 
network; 

inputting the output phrase break, previously generated through the RNN network, to 

the third input of the RNN network; 
inputting the seeend -particular POS tag of the tag s e qu e nc e t o the fourth input of the 

RNN network; 

inputting the-anext POS tag following the particular POS tag fr em-in the tag sequence 

to the fifth input of the RNN network; and 
generating a next phrase break associated with the next POS tag through the RNN 

network. 

9. (Original) The method of claim 1 , wherein the phrase break is generated based on the 

previously inputted POS tags and previously generated phrase breaks, through the 
RNN network. 

10. (Currently Amended) A method, comprising: 

retrieving a particular POS tag from a.tag sequence; 
inputting the particular POS tag to an RNN network, including 

inputting a first phrase break to a first input of the RNN network, 
inputting a first POS tag from the tag sequence to a second input of the RNN 
network, wherein the first POS tag is associated with the first phrase 
break, 

inputting a second phrase break to a third input of the RNN network. 
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inputting a second POS tag from the tag sequence to a fourth input of the RNN 
network, wherein the second POS tag is associated with the second 
phrase break, and wherein the second POS tag is next to the first POS 
tag in the tag sequence, and 

inputting the particular POS tag to a fifth input of the RNN network; 

initializing and inputting a first initial phras e br e ak to a first input of th e RNN 
n e twork, 

initializing and inputting a first initial POS tag to a s e cond input of th e RNN 
network, 

initializing and inputting a s e cond initial phrase br e ak to a third input of th e 

RNN n e twork, and 
inputting th e first POS tag of th e tag sequence to a fourth input of th e RNN 

n e twork; and 

inputting th e s e cond POS tag of th e tag s e qu e nc e to a fifth input of th e RNN 
n e twork; 

generating an output phrase break associated with the particular POS tag, from the 

RNN network; 
retrieving a next POS tag from the tag sequence; and 

repeating inputting the particular POS tag, generating an output phrase break, and 

retrieving a next POS tag, until there are no more POS tags to be processed in 
the tag sequence. 

11. (Canceled) 

12. (Currently Amended) The method of claim 10, further comprising: 

inputting the second iaitial-phrase break to the first input of the RNN network; 
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inputting the fifsi -second P OS tag of tho tag s e qu e nc e to the second input of the RNN 
network; 

inputting the output phrase break, previously generated through the RNN network, to 

the third input of the RNN network; 
inputting the seeend -particular P OS tag of th e tag o e quonco to the fourth input of the 

RNN network; 

inputting the-anext POS tag following the particular POS tag fe em-in the tag sequence 

to the fifth input of the RNN network; and 
generating a next phrase break associated with the next POS tag through the RNN 

network. 

13. (Currently Amended) The method of claim 10, wherein the first p hrase break is 

generated based on the previously inputted POS tags and previously generated phrase 
breaks, through the RNN network. 

14. (Currently Amended) An apparatus, comprising: 

an interface to receive a text sentence comprising a plurality of words, each of the 

plurality of words having a part of speech (POS) tag; 
a text processing unit to generate a POS sequence based on the POS tag of each of the 

plurality of words; 

an recurrent neural network (RNN) to detect a prosodic phrase brea k from an output 
phrase break, based on the POS sequence and generating a prosodic phrases 
boundary based on the prosodic phrase break, wherein the RNN network 
comprises 

an input layer for receiving input data, including 

a first input to receive a first «itial-phrase break, 
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a second input to receive a first initial POS ta g from the tag sequence, 
wherein the first POS tag is associated with the first phrase 
break, 

a third input to receive a second iftitiaJ-phrase break, 

a fourth input to receive a first -second POS tag frome f the tag 

sequence, wherein the second POS tag is associated with the 
second phrase break, and 
a fifth input to receive a seeend -third P OS tag of the tag sequence, 
a hidden layer to perform a prosodic phrase break detection, and 
an output layer to generate athe prosodio output p hrase break; and 
a speech processing unit to perform speech analysis on the prosodic phrase breaks and 
generating an output speech based on the prosodic phrase breaks. 

15. (Original) The apparatus of claim 14, wherein the text processing unit assigns the POS 

tag for each of the plurality of words of the sentence, and classifies the POS tag for 
each of the plurality of words to a predetermined class. 

16. (Canceled) 

17. (Currently Amended) The apparatus of claim 14, wherein: 

the first input receives the second initial phrase break; 
the second input receives the firs ^second POS ta g of the tag sequence ; 
the third input receives the output phrase break, previously generated; 
the fourth input receives the seeeftd -third POS ta g of th e tag s e qu e nc e; and 
the fifth input receives the-anext POS tag following the third POS tag in# em- the tag 
sequence. 
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18. 



(Currently Amended) The apparatus of claim 14, wherein the first p hrase break is 
generated based on the previously inputted POS tags and previously generated phrase 
breaks, through the RNN network. 



19. (Currently Amended) A machine-readable medium having stored thereon executable 
code which causes a machine to perform a method, the method comprising: 
in response to a text sentence comprising a plurality of words, each having a part of 

speech (POS) tag, generating a POS sequence based on the POS tag of each of 

the plurality of words; 
detecting a prosodic phrase break from an output phrase break through a recurrent 

neural network (RNN), based on the POS sequence, through iterative 

operations including 

retrieving a particular POS tag from the tag sequence, 
inputting the particular POS tag to the RNN network, including 

inputting a first phrase break to a first input of the RNN network, 
inputting a first POS tag from the tag sequence to a second input of the 
RNN network, wherein the first POS tag is associated with the 
first phrase break, 
inputting a second phrase break to a third input of the RNN network, 
inputting a second POS tag from the tag sequence to a fourth input of 
the RNN network, wherein the second POS tag is associated 
with the second phrase break, and wherein the second POS tag 
is next to the first POS tag in the tag sequence, and 
inputting the particular POS tag to a fifth input of the RNN network, 
initializing and inputting a first initial phras e br e ak to a first input of the 
RNN n e twork, 
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initializing and inputting a first initial POS tag to a second input of tho 
RNN network, 

initializing and inputting a s e cond initial phrase brook to a third input of 

tho RNN n e twork, 
inputting th e first POS tag of tho tag s e qu e nce to a fourth input of th e 

RNN n e twork, and 
inputting tho s e cond POS tag of th e tag s e quenc e to a fifth input of th e 

RNN n e twork, 

generating thean output phrase break associated with the particular POS tag, 

from the RNN network, 
retrieving a next POS tag from the tag sequence, and 

repeating the above iterative operations until there are no more POS tags to be 
processed in the tag sequence; and 
generating a prosodic phrases boundary based on the prosodic phrase break. 

20. (Original) The machine-readable medium of claim 19, wherein the method further 

comprises: 

assigning a POS tag for each of the plurality of words of the sentence; and 
classifying the POS tag for each of the plurality of words to a predetermined class. 

21. (Canceled) 

22. (Currently Amended) The machine-readable medium of claim 19, wherein the method 

further comprises: 

inputting the second iakial-phrase break to the first input of the RNN network; 
inputting the fest -second POS tag of tho tag sequence to the second input of the RNN 
network; 



Application Serial No. 10/052,147 



Atty. Docket No. 42P 10423 



inputting the output phrase break, previously generated through the RNN network, to 

the third input of the RNN network; 
inputting the particular s eeeftd POS tag of th e tag sequ e nce to the fourth input of the 

RNN network; 

inputting the-anext POS tag following the particular POS tag infe ea* the tag sequence 

to the fifth input of the RNN network; and 
generating a next phrase break associated with the next POS tag through the RNN 

network. 

23. (Currently Amended) A machine-readable medium having stored thereon executable 
code which causes a machine to perform a method, the method comprising: 
retrieving a particular POS tag from a tag sequence; 
inputting the particular P OS tag to an RNN network, including 

inputting a first phrase break to a first input of the RNN network, 
inputting a first POS tag from the tag sequence to a second input of the RNN 
network, wherein the first POS tag is associated with the first phrase 
break, 

inputting a second phrase break to a third input of the RNN network, 
inputting a second POS tag from the tag sequence to a fourth input of the RNN 

network, wherein the second POS tag is associated with the second 

phrase break, and wherein the second POS tag is next to the first POS 

tag in the tag sequence, and 
inputting the particular POS tag to a fifth input of the RNN network; 
initializing and inputting a first initial phras e break to a first input of th e RNN 

n e twork, 

initializing and inputting a first initial POS tag to a s e cond input of th e RNN 
n e twork, 
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initializing and inputting a s e cond initial phras e br e ak to a third input of th e 
RNN n e twork, 

inputting th e first POS tag of th e tag sequ e nc e to a fourth input of th e RNN 
network, and 

inputting the second POS tag of th e tag s e qu e nce to a fifth input of th e RNN 
network; 

generating an output phrase break associated with the POS tag, from the RNN 
network; 

retrieving a next POS tag from the tag sequence; and 

repeating inputting the particular POS tag, generating an output phrase break, and 

retrieving a next POS tag, until there are no more POS tags to be processed in 
the tag sequence. 

24. (Canceled) 

25. (Currently Amended) The machine-readable medium of claim 3423, wherein the method 

further comprising: 

inputting the second initial-phrase break to the first input of the RNN network; 
inputting the first -second P OS tag of th e tag s e qu e nc e to the second input of the RNN 
network; 

inputting the output phrase break, previously generated through the RNN network, to 

the third input of the RNN network; 
inputting the second-particular POS tag of th e tag s e qu e nc e to the fourth input of the 

RNN network; 

inputting the next POS tag from th e tag o e qu e nco t o the fifth input of the RNN 
network; and 
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generating a next phrase break associated with the next POS tag through the RNN 
network. 



26. (Currently Amended) The machine-readable medium of claim 3423, wherein the first 
phrase break is generated based on the previously inputted POS tags and previously 
generated phrase breaks, through the RNN network. 
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